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WAFER INSPECTION SYSTEM 

CROSS-REFERENCE TO RELATED PATENT APPLICATIONS 

[0001] This application is related to as a continuation-in-part and claims priority from 
copending U.S. Pat. App. Ser. No. 10/631,469 file July 29, 2003 which claims priority from U.S. 
Prov. Pat. App. Ser. No. 60/444,504 filed February 3, 2003. 

FIELD OF THE INVENTION 

[0002] The present invention is directed to methods and systems for the inspection of 
semiconductor wafers and other materials such as integrated circuits (IC) and any surface 
benefiting from inspection. Hereinafter, any material susceptible of surface inspection by the 
system herein described contact potential difference imaging device will be denoted a "wafer". 
More particularly, the present invention is directed to a method and system for the 
characterization of microscopic and macroscopic defects through imaging and visualization of 
the contact potential difference topology on the wafer surface through the use of a non- vibrating 
contact potential difference sensor. 

BACKGROUND OF THE INVENTION 

[0003] The multi-billion dollar global market for semiconductor defect management is growing 
both in absolute terms and as a percentage of semiconductor capital equipment investment. In 
general, there are two factors that determine the economics of a semiconductor fabrication 
facility at a given utilization level, namely throughput and yield. As complex new technologies 
such as 300 mm wafers, copper interconnects, and reduced feature (circuit) sizes drive the 
margin of error in fabrication ever lower, new inspection technologies are critical to keep yields 
high and bottom-line economics attractive. Detection and elimination of chemical contamination 
and other types of defects is a constant concern for semiconductor manufacturers and equipment 
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suppliers. Contamination can arise from use of processing chemicals, processing equipment, and 
poor handling techniques. Contaminants can include, for example, metals, carbon, and organic 
compounds. Other types of defects can result from a wide range of causes, including flaws in the 
semiconductor crystal, improper processing, improper handling, and defective materials. In 
addition, many cleaning steps are required in wafer fabrication, such as but not limited to the 
semiconductor industry. Each step is time consuming and requires expensive chemicals that may 
require special disposal procedures. Existing methods for monitoring or controlling these 
processes are expensive and time consuming. As a result, wafers are often cleaned for a longer 
period of time and using more chemicals than are required. 

[0004] Defect detection and characterization systems can be divided into in-line and off-line 
systems. "In-line" refers to inspection and measurement that takes place inside the clean room 
where wafers are processed. "Off-line" refers to analysis that takes place outside of the wafer 
processing clean room, often in a laboratory or separate clean room that is located some distance 
from the manufacturing area. In addition, many of these analytical techniques are destructive, 
which requires either the sacrifice of a production wafer or the use of expensive "monitor" 
wafers for analysis. In-line inspection and measurement is crucial for rapidly identifying and 
correcting problems that may occur periodically in the manufacturing process. A typical 
semiconductor wafer can undergo over 500 individual process steps and require weeks to 
complete. Each semiconductor wafer can have a finished product value of up to $100,000. 
Because the number of steps and period of time, involved in wafer fabrication are so large, a lot 
of work in process can exist at any point in time. It is critical that process-related defects be 
found and corrected immediately before a large number (and dollar value) of wafers are affected. 
Such defects, regardless of the nature of the wafer, semiconductor, IC, or other device, are 
detrimental to performance and diminish productivity and profitability. 

[0005] Many types of defects and contamination are not detectable using existing in-line tools, 
and these are typically detected and analyzed using expensive and time-consuming "offline" 
techniques (described below) such as Total Reflectance X-ray Fluorescence (TXRF), Vapor 
Phase Decomposition Inductively Coupled Plasma-Mass Spectrometry (VPD ICP-MS) or 
Secondary Ion Mass Spectrometry (SIMS). Since these techniques are used off-line (outside of 
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the clean room used to process wafers) and usually occur hours, or even days, after the process 
step that has caused the contamination, their value is significantly limited. 
[0006] A brief description of some well known techniques for wafer inspection and chemical 
contamination detection are presented in Table 1. This list is not in any sense exhaustive as there 
are a very large number of techniques that are used for some type of semiconductor analysis or 
characterization or for other surface inspection of other types of materials. 

Table 1 



Analytical Technique 


Description 


In-line/Off-line 


Total Reflection X-Ray 
Fluorescence (TXRF) 


X-rays irradiate the wafer within the 
critical angle for total external 
reflectance, causing surface atoms to 
fluoresce. 


Off-line 


Automated Optical 
Microscopy 


Optical images are acquired and 
automatically analyzed for detection 
of large defects. 


In-line 


Laser Backscattering 


Wafer surface is illuminated with 
laser spots and the angle and/or 
polarization of reflected light is 
analyzed to detect and classify 
particles. 


In-line 


Vapor Phase Decomposition 
Inductively Coupled-Mass 
Spectrometry (VPD ICP- 
MS) 


Wafers "scanned" with a drop of HF 
that is analyzed using mass 
spectrometry. 


Off-line 


Secondary Ion Mass 
Spectroscopy (SIMS) 


Ion beam sputters the wafer surface 
creating secondary ions that are 
analyzed in a mass spectrometer. 


Off-line 



[0007] Table 2 summarizes some major advantages and disadvantages of each example 
technique. In general, off-line detection techniques are extremely sensitive to tiny amounts of 



-3- 

011.1179188.1 



Atty. Dkt. No.: 023228-0109 



contamination; but are slow, expensive, and complex to operate. Some have limited, or no, 
imaging or surface mapping capability, or are destructive in nature. In-line techniques are much 
faster, non-destructive, and provide defect mapping, but have limited chemical contamination 
detection or analysis capability. 

Table 2 



Analytical Techniaue 


Advantaaes 


D i oa rl \i a n t r) flP 


Total Reflection X- 
Ray Fluorescence 
(TXRF) 


* Very sensitive 

* Some mapping capability 

* Nondestructive 


* Limited coverage 

* Unpatterned wafers only 


Automated Optical 
Microscopy 


* Fast 

* Relatively low cost 

* Detects a wide range of 
macro defects (>50 
microns) 

* Imaging of wafer surface 

* Non-contact/non- 
destructive 


* Very limited chemical 
and particle detection 


Laser Backscattering 


* Fast 

* Relatively low cost 

* Detects very small 
particles 

* Imaging of water surface 

* Non-contact/non- 
destructive 


* Only detects particles - 
no chemistry 


Vapor Phase 
Decomposition 
Inductively Coupled- 
Mass Spectrometry 
(VPD ICP-MS) 


* Very sensitive 

* Able to identify wide 
range of contaminants 


* Destructive 

* Slow 

* Expensive 

* Complex 

* Cannot image 

* Only works on bare 
silicon 
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Analytical Technique 


Advantages 


Disadvantages 


Secondary Ion Mass 
Spectroscopy (SIMS) 


* Very sensitive 

* Detects a wide range of 
contaminants 

* Sub-surface detection 


* Expensive 

* Slow 

* Destructive 



[0008] In general, existing in-line wafer inspection tools operate at production speeds and 
generate images of the wafer surface that are processed to identify and locate defects. However, 
these techniques are, as mentioned above, very limited in their ability to detect chemical 
contamination. Laser backscattering systems are limited to detecting particles down to sub- 
micron sizes, and optical microscopy systems can only detect chemical contamination that 
results in a visible stain or residue. Both techniques lack the ability to identify or classify the 
chemical composition of the particle or contamination. Off-line laboratory techniques are used 
to qualify the cleanliness of new processes and equipment, or to analyze defects detected by in- 
line equipment or as part of failure analysis. 

[0009] Another system that has been investigated is the use of Contact Potential Difference 
imaging (CPD). CPD refers to the electrical contact between two different metals and the 
electrical field that develops as a result of the differences in their respective maximum electronic 
energy level, i.e. their respective Fermi energies. When two metals are placed in contact, the 
Fermi energies of each will equilibrate by the flow of electrons from the metal with the lower 
Fermi energy to that of the higher. "Vibrating CPD sensor" refers to the vibration of one metal 
relative to the other in a parallel plate capacitor system. The vibration induces changes in the 
capacitance with time, and therefore a signal related with the surface profile. A CPD signal can 
also be generated by the translation of one surface past a reference sample through the use of a 
non- vibrating contact potential difference (nvCPD) sensor(s). This translation makes high speed 
scanning possible. 

[0010] However, even these nvCPD sensors can themselves present certain difficulties. At a 
microscopic level, the surfaces of wafers are not flat due to wafer thickness variation, materials 
on the surface, "bowing", and other factors. In order to scan the wafer at a close but safe (i.e., 
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close to the surface to promote good signal strength but far enough away to minimize any 
possibility of impacting the wafer surface) distance, an appropriate sensor height must be 
calculated and set. Thus, the height of the sensor above the wafer surface must be measured and 
controlled to produce repeatable results. Furthermore, height control is also necessary to 
minimize the sensor height to improve resolution and signal strength. However, height is 
difficult to control and measure, as is the appropriate height for each specific wafer. 
[0011] It is possible to use one of many commercially available height sensors to control the 
height of the nvCPD sensor above the wafer surface. This requires the expense of an additional 
sensor, and the added complexity of a calibration routine to determine the position of the nvCPD 
sensor tip relative to measurements made by the separate height sensor. 
[0012] A related problem is the difficulty in establishing a point of reference for all distance 
measurements, including height, related to an nvCPD scan. A reference point is needed to 
produce useful measurement data for image production. 

[0013] In some sensor systems, such as nvCPD sensors, it is necessary to separate the sharp 
peak signal from the other two components of the signal (low frequency signal and induced noise 
signals) to locate and measure the contaminated areas of a wafer. This is challenging because 
the sharp peak signal behaves like noise, i.e., it consists of sharp peaks that alternate their 
polarity in high frequency mode. Because of this, conventional high frequency filters based on 
the frequency domain only do not work, as they would degrade the sharp peak signal 
significantly along with the noise. 

[0014] In addition, an nvCPD signal is generally delayed in time, which impacts on the quality 
of the nvCPD signal/image. As the sampling time increases, the time delay becomes larger. The 
time delay may be explained by the equivalent RC circuit modeling the electrical signal path 
from the probe tip to the output of the A/D converter through the amplifier, the data acquisition 
board and the connecting lines between them. The equivalent capacitance is mixed with the 
capacitance between the probe and the wafer surface, the parasitic capacitance of the connecting 
lines, the internal capacitance of the amplifier, and other known conventional effects. The result 
is that minute feature signals are less detectable, and the signal magnitude and thus the signal-to- 
noise ratio are smaller. 
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[0015] A critical need therefore exists for a fast, inexpensive, and effective means of detecting, 
locating, and classifying relatively small quantities of chemical content and physical features on 
wafers. In addition, there is a need for a system which minimizes cost and complexity of the 
sensor control mechanisms, such as height control. Furthermore, there is a need for methods and 
systems that have improved signal processing. 

[0016] It is therefore an object of the invention to provide an improved method and system for 
inspection of surfaces of any materials, such as semiconductor wafers and other electronic 
devices. 

[0017] It is an additional object of the invention to provide an improved method and system for 
providing images of surface defects on a semiconductor wafer or an integrated circuit device. 
[0018] It is yet another object of the invention to provide an improved method and system for 
identifying different classes of wafer surface defects by pattern recognition. 
[0019] It is still a further object of the invention to provide an improved method and system for 
classifying categories of surface defects on semiconductor wafers, including without limitation 
surface defect states, electrostatic field variations, oxide states and chemical contamination. 
[0020] It is also an additional object of the invention to provide an improved method and 
system for sensing electrostatic fields arising from semiconductor wafer surface defects. 
[0021] It is yet another object of the invention to provide an improved method and system for 
detecting the presence of thin dielectric films on surfaces of semiconductor wafers and to detect 
film defects such as pinholes, bubbles, delaminations, or contamination under the film. 
[0022] It is a further object of the invention to provide an improved method and system to 
sense variations in oxide states on semiconductor wafer surfaces. 

[0023] It is also a further object of the invention to provide an improved method and system to 
classify particulate contaminants on semiconductor wafers identified initially by optical 
inspection systems. 

[0024] It is yet a further object of the invention to provide an improved method and system for 
detecting variations in dopant concentration of semiconductor wafers. 

[0025] It is another object of the invention to provide an improved method and system for use 
of an nvCPD sensor to inspect the surface quality of semiconductor wafers. 
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[0026] It is still another object of the invention to provide an improved method and system of 
nvCPD sensors in combination with other inspection systems for evaluating semiconductor 
wafer surface properties. 

[0027] It is a further object of the invention to provide an improved method and system for 
producing topological images of differing contact potential characteristic of defects on a 
semiconductor wafer. 

[0028] It is also an object of the invention to provide an improved method and system for 
rapidly scanning the surface of a semiconductor wafer to identify sub-microscopic, microscopic 
and macroscopic surface defects characterized by potential field disturbances on the wafer 
surface. 

[0029] It is also an object of the invention to provide an improved method and system for 
detecting the cleanliness of a semiconductor wafer to determine if a cleaning process has 
eliminated all contaminants and to avoid the time and expense of cleaning wafers for longer than 
is necessary to remove contaminants or to perform unnecessary processing steps. 
[0030] In each case described above, wafer surface can refer to the front-side (patterned side) 
of the wafer, back-side (unpatterned side) of the wafer, or the edge of the wafer or any surface 
undergoing inspection regardless of the type of material. 

[0031] Other objects, features and advantages of the present invention will be readily apparent 
from the following description of the preferred embodiment thereof, taken in conjunction with 
the accompanying drawings described below. 

SUMMARY OF THE INVENTION 

[0032] The present invention provides a wafer inspection system that is a fast, inexpensive, and 
effective means of detecting, locating, and classifying relatively small quantities of chemical 
content, and physical features on wafers, such as but not limited to semiconductor production, 
integrated circuit devices, or any material which may benefit from such inspections, while 
allowing for a minimization of the complexity of the sensor control mechanisms and an 
improvement in signal processing. In one preferred example embodiment, a wafer inspection 
system of the present invention includes steps for identifying a defect on a surface of a 
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semiconductor wafer. In a preferred embodiment, the steps comprise providing a semiconductor 
wafer; providing a non- vibrating contact potential difference sensor; scanning the semiconductor 
wafer relative to the non- vibrating contact potential difference sensor; generating contact 
potential difference data from the non- vibrating sensor; and processing the non- vibrating contact 
potential difference sensor data to automatically detect a pattern that is characteristic of a 
particular type of defect. 

[0033] In addition, the system of the present invention provides, in a preferred embodiment, a 
method for determining a reference point for the sensor. In addition, in some embodiments of 
the present invention the system includes a method for determining the height of the sensor. In 
addition, the present invention may preferably include a method for calculating the scan height to 
allow for wafer height variation. Furthermore, a system in accordance with the principles of the 
present invention preferably includes signal processing methods and devices for improving the 
native signal output of the sensor, such as by reducing noise or reducing signal time delay. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0034] FIG. 1 illustrates one embodiment of the nvCPD scanning method and system; 
[0035] FIG. 2 illustrates the concept of the contact potential difference methodology; 
[0036] FIG. 3 illustrates an nvCPD scanning method; 

[0037] FIG. 4 illustrates the current output of an nvCPD probe as it passes over a positive and 
negative work function transition; 

[0038] FIG. 5 illustrates axial orientation of the nvCPD system; 

[0039] FIG. 6 illustrates standard deviation of signals within a scan area; 

[0040] FIG. 7 illustrates steps for creating test wafers which are partially coated with known 

concentrations of contaminants; 

[0041] FIG. 8 A illustrates one form of scanning nvCPD system with a three axis linear 
positioning system with the nvCPD sensor and a wafer mounted on a high speed spindle; and 
FIG. 8B illustrates another form of scanning nvCPD system; 

[0042] FIG. 9 illustrates a flow diagram for the image acquisition process of a radially scanned 
nvCPD imaging system; 
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[0043] FIG. 10A illustrates an optical image of a 100mm diameter silicon wafer after 
application of a vacuum pick-up device and FIG. 10B illustrates an nvCPD image of the wafer of 
FIG. 10A; 

[0044] FIG. 1 1 A illustrates an optical image of a second silicon wafer after applying alcohol 
while spinning the wafer and allowing the alcohol to dry and FIG. 1 IB is an nvCPD image of the 
same wafer of FIG. 11 A; 

[0045] FIG 12A illustrates an optical image of a silicon wafer after application of a latex glove 

mark and FIG. 12B is an nvCPD image of the same wafer of FIG. 12 A; 

[0046] FIG. 13A illustrates an optical image of a silicon wafer having human fingerprints on 

the wafer and FIG. 13B illustrates an nvCPD image of the wafer of FIG. 13 A; 

[0047] FIG. 14 illustrates an nvCPD image of a silicon wafer after brushing the wafer surface 

with a stainless steel tool; 

[0048] FIG. 15 illustrates an nvCPD image of a silicon wafer after pressing an aluminum 
fixture onto the wafer surface; 

[0049] FIG. 16 illustrates a chart depicting a typical nvCPD signal where there is a set of peaks 
comprising a positive peak and a negative peak having non-equivalent heights; 
[0050] FIG. 17 illustrates a chart depicting a signal output of one embodiment of the present 
invention where the positive peak height is substantially equivalent to the negative peak height; 
[0051] FIG. 18 is a detailed view of the Adjust Starting Position and Height of Probe Above 
Surface step of FIG. 9; 

[0052] FIG. 19 illustrates NCVPD processed wafer images before deconvolution; 

[0053] FIG. 20 illustrates NCVPD processed wafer images after deconvolution; 

[0054] FIG. 21 A illustrates a wafer map produced in accordance with the principles of the 

present invention, wherein the wafer pattern is one atomic layer thick over native silicon oxide; 

[0055] FIG 2 IB is a graph showing signal strength along a single probe track; 

[0056] FIG. 21 C is a graph of the signal strength versus the density of gold for the wafer map 

depicted in FIG. 21 A; 

[0057] FIG 22A is a 2D Edge Detection optical view using Canny Algorithm at Multiple 
Resolutions (#7 Wafer dipped into a CMP Slurry); 
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[0058] FIG 22B is a 2D Edge Detection image produced in accordance with the principles of 
the present invention using Canny Algorithm at Multiple Resolutions (#7 Wafer dipped into a 
CMP Slurry, threshold=0.00001, Contamination Level=24.5); 

[0059] FIG 22C is a 2D Edge Detection image produced in accordance with the principles of 
the present invention using Canny Algorithm at Multiple Resolutions (#7 Wafer dipped into a 
CMP Slurry, threshold =0.008, Contamination Level =4.5); 

[0060] FIG 22D is a 2D Edge Detection image produced in accordance with the principles of 
the present invention using Canny Algorithm at Multiple Resolutions (#7 Wafer dipped into a 
CMP Slurry, threshold=0.01, Contamination Level =1.9); 

[0061] FIG 22E is a 2D Edge Detection image produced in accordance with the principles of 
the present invention using Canny Algorithm at Multiple Resolutions (#7 Wafer dipped into a 
CMP Slurry, threshold=0.012, Contamination Level =1.1); 

[0062] FIG 22F is a 2D Edge Detection image produced in accordance with the principles of 
the present invention using Canny Algorithm at Multiple Resolutions (#7 Wafer dipped into a 
CMP Slurry, threshold=0.014, Contamination Level =0.8); 

[0063] FIG. 23 A an optical image of 2D Edge Detection produced in accordance with the 
principles of the present invention using Canny Algorithm at Different Scales (Qcept #6 Wafer 
dipped into a CMP Slurry); 

[0064] FIG. 23B an optical image of 2D Edge Detection produced in accordance with the 

principles of the present invention using Canny Algorithm at Different Scales (Qcept #6 Wafer 

dipped into a CMP Slurry, threshold=0.00001, Contamination Level =24.3); 

[0065] FIG. 23C an optical image of 2D Edge Detection produced in accordance with the 

principles of the present invention using Canny Algorithm at Different Scales (Qcept #6 Wafer 

dipped into a CMP Slurry, threshold=0.005, Contamination Level =9.6); 

[0066] FIG. 23D an optical image of 2D Edge Detection produced in accordance with the 

principles of the present invention using Canny Algorithm at Different Scales (Qcept #6 Wafer 

dipped into a CMP Slurry, threshold=0.006, Contamination Level =8.2); 
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[0067] FIG. 23E an optical image of 2D Edge Detection produced in accordance with the 
principles of the present invention using Canny Algorithm at Different Scales (Qcept #6 Wafer 
dipped into a CMP Slurry, threshold=0.008, Contamination Level =6.9); and 
[0068] FIG. 23F an optical image of 2D Edge Detection produced in accordance with the 
principles of the present invention using Canny Algorithm at Different Scales (Qcept #6 Wafer 
dipped into a CMP Slurry, threshold=0.009, Contamination Level =6.4). 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

[0069] A preferred embodiment of the invention is directed to an improved use of an nvCPD 
sensor. In particular, FIG. 1 illustrates a functional block flow diagram of components and 
operation of one preferred form of an nvCPD scanning system 10. A nvCPD sensor 12 (see 
FIG. 2) is based on the phenomena of contact potential difference which is a voltage generated 
between two dissimilar materials brought in close proximity to each other. An illustration of this 
concept can be seen in FIG. 2. In the case of the wafer scanning system 10, the sensor tip 13 
forms a first plate 14 and a wafer 15 having a wafer surface 16 forms a second plate 18 (see FIG. 
2.) Probe tip surface 20 of the first plate 14 is made of a conducting material with a fixed work 
function - generally, the difference in energy between the Fermi level of the solid and the free 
energy of the space outside the solid, including, in metals, the image potential of electrons just 
outside the surface. The wafer surface 16 of the second plate 18 has a work function which can 
vary due to irregularities in the semiconductor wafer surface 16 or contaminants or other 
materials deposited on the wafer surface 16. When the first plate 14 and the second plate 18 are 
electrically connected, the Fermi levels of the respective surface equilibrate and form an electric 
field between them. If the work function of the sensor tip 13 is fixed, the magnitude of the 
electric field is then related to the distance between the first plate 14 and the second plate 1 8, the 
relative dielectric between the first plate 14 and the second plate 18 and the work function of the 
wafer surface 16. In practice the first plate 14 and the second plate 18 equilibrate rapidly 
providing little to measure. To provide a current flow that can be measured, some motion of the 
sensor tip 12 relative to the wafer surface 16 must be realized. In one embodiment, the nvCPD 
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sensor 12 is moved over the surface at a substantially fixed distance and variations in the wafer 
surface 16 cause a current to flow. 

[0070] An illustration of this concept can be seen in FIG. 3. The current flow from this nvCPD 
sensor 12 can be modeled by the following equation: 



dt dt 



wherein C and Fare defined as 



p p A O — (T) r 



and further wherein s Q is the permittivity of free space, s r is the relative dielectric constant, A is 

the area of the probe tip, d is the distance between the sensor tip 13 and the wafer 15, 0 is the 

work function of the respective surface, and e is the charge on an electron. The Fterm can also 

be described as a difference in surface potentials between the nvCPD sensor 12 and the wafer 15. 

In addition the surface potentials on the wafer surface 16 can vary due to defects. The overall 

surface potential is related to the underlying materials work function but it can also be affected 

by adsorbed layers of material on the wafer surface 16. Even sub mono-layers of materials are 

known to significantly affect the surface potential. 

d V Oz? to be — Qwafer 

[0071] The C — « C — term is related to changes in work function on the 

dt At 5 

wafer surface 16. It can be seen that the magnitude of this term is related to the relative changes 

in work function on the wafer surface 16 and relative speed at which the nvCPD sensor 12 is 

moved over the wafer surface 16. An illustration of the signal generated from this can be seen in 

FIG. 4. Thus, a system in accordance with the principles of the present invention is capable of 

generating one-dimensional signals and two-dimensional images, although three-dimensional 

images can be generated. 

[0072] Many defects can present themselves as variations in the wafer work function or the 
overall surface potential. Both chemical and physical (i.e., geographical) features of the wafer 
surface and the underlying materials can affect the work function of a particular portion or even a 
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single point on the wafer surface; thus, these features can be detected by a sensor in accordance 
with the principles of the present invention. For instance, variation in semiconductor dopant 
concentrations in the wafer 1 5 will cause varying characteristic work functions. In addition, 
other materials that could diffuse into the wafer 15, such as but not limited to copper, will cause 
variations in work function. Within the semiconductor material (or any other material 
susceptible to measurement) itself, mechanical phenomena such as dislocation pile-ups, cracks, 
and scratches generate local stresses which will change the local work function. In addition, 
adsorbed layers of atomic or molecular contaminants even at the sub monolayer level will 
generate appreciable surface potential variations. Particles deposited on the wafer 16 with a 
surface potential different than the surrounding wafer material will also create a signal. Layers 
of chemicals commonly used in the wafer fabrication process will affect the surface potential of 
the wafer. For instance residual CMP slurry or photo-resist would cause local variations in 
surface potential detectable by the nvCPD sensor 12 of the present invention. Such defects and 
chemistry have associated with them characteristic signatures which enable inspection of the 
wafer surface. 

dC 

[0073] The V — term is related to changes in gap between the nvCPD sensor 12 and the 
dt 

wafer 15 or variations in the relative dielectric constant. Geometrical imperfections in the wafer 
surface 16 or particles on the wafer surface 16 would manifest themselves in this component. 
Also because of its differential nature, the magnitude of this component would also increase as 
the relative speed of the nvCPD sensor 12 to the wafer 15 is increased. 

[0074] As previously mentioned, physical or geographical aspects and defects can be imaged 
using a system in accordance with the principles of the present invention. Many classes of wafer 
defects would appear as geometrical changes in the wafer surface 16. In the wafer 15 itself, 
surface cracks, scratches, etched trenches, etc. would be nonlimiting examples of such defects 
causing a geometrical change in the wafer surface and an attendant change in the work function. 
In addition, particles deposited on the wafer 15 would also present themselves as a local change 
in the distance to the probe sensor tip 13. 
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[0075] Variations of dielectric films on the wafer 15 can also be detected. An example would 
be detecting variations in the oxide state grown on the silicon substrate (i.e. SiO, Si02, Si(>3, 
SiCU). In addition, variations in dielectric of other non-conducting materials commonly 
deposited on the wafer could be detected. 

[0076] It should also be noted that many features could present themselves as combinations of 
geometrical changes and chemical changes. For instance, a particle deposited on the wafer 15 of 
differing material than the underlying wafer 15 could cause variation. Also, a crack in the 
surface would also induce stresses that would cause variations in local work function. 
[0077] In FIG. 5 is schematically shown one form of the system 10 for application of the 
nvCPD sensor 12 to scan the wafer 15 for defects and contamination. FIGS. 8 A and 8B also 
illustrate more detailed drawings of two alternative operating embodiments of the system 10. 
The system 10 in FIG. 5 includes an X-Y-Z positioning system 26, a rotating wafer stage 28, a 
high speed data acquisition system 30 with a personal computer (PC) 32, and control software 
executed by the PC 32. 

[0078] As shown in more detail in FIG. 8 A, in one embodiment, the wafer 15 is affixed to a 
rotating spindle or chuck 54 (see FIG. 1) using a clamping fixture 56 on the wafer edges. A 
sensor positioning system 50 includes an nvCPD sensor 52 positioned a fixed distance from the 
wafer 15 is mounted to a spindle 54. The wafer 15 (not seen in this view) is then rotated at high 
speed, and the nvCPD sensor 52 is translated radially to collect data in circumferential tracks. 
The scanning procedure as shown schematically in FIG. 9 lasts between a few seconds and 
several minutes, depending on the number of scanned tracks, the speed of the spindle 54, and the 
speed of the sensor positioning system 52. The tracks of data are then put together to form a 
CPD image. These CPD images allow the visualization of chemical and geometrical defects and 
thereby enable classification of the type of defect present on the wafer surface. Some examples 
of these CPD images can be seen in FIG. 10A-15 and are taken from a 100mm wafer compared 
with optical images of the same wafer (see, Example infra). The present invention is capable of 
generating image maps of one atomic layer thick patterns, as shown in FIG. 21 A. FIG. 21B 
illustrates the signal strength as the wafer is rotated relative to the probe, thus passing over 



-15- 



011.1179188.1 



Atty. Dkt. No.: 023228-0109 



defects and features of the wafer surface. As shown in FIG. 21C, the present invention, in fact, 
detected sputtered gold at densities less than a single atomic layer. 
[0079] The images generated by the scanning procedure of FIG. 9 were subsequently 
processed to automatically locate defects; thus locating areas of high variability. An ideal 
surface would exhibit a flat signal, but a wafer surface with defects would exhibit some 
variability in the signal. To locate areas with defects, the data was broken up in to small areas of 
known location. The standard deviation of the signal within these areas was determined. Areas 
with defects showed a higher standard deviation, and these results can be seen in FIG. 6. Areas 
with defects appear brighter than lower variability areas of the wafer 15. This is one of many 
possible methods in accordance with the principles of the present invention to process the sensor 
data. 

[0080] More generally, a defect can be identified by one or more of the following: 

• Process the data to look for a voltage or change in voltage (or pattern of voltages 
or changes in voltages) that exceeds some user-defined value (threshold). 

• Compare the data to a known pattern that represents a defect via some form of 
correlation or template matching. 

• Convert the spatial data to the frequency domain and then identify peaks in the 
frequency domain that represent defects with unique spatial characteristics. 

[0081] These techniques can also be combined with other techniques to yield analytical results. 
The signal may also be preprocessed to facilitate defect detection, such as, for example: 

• Since the signal is differential, it can be integrated over some distance to produce 
voltages that represent relative CPD's over the surface of the wafer 15. 

• If the wafer 1 5 is "patterned", then this known pattern can be removed from the 
data prior to processing. This would likely be accomplished through some 
conventional method of variation of image or signal subtraction in either the space 
or frequency domains. 

• The signal would likely be processed with some form of frequency filtering to 
remove high or low frequencies depending on the size, shape and other 
characteristics of the expected defects. 
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• The signal could be processed to remove features of a certain size by doing what 
is called "morphological processing" which is well known in the art. 

[0082] In one embodiment, a defect is detected and the contamination level is quantified based 
on an edge detection algorithm, such as but not limited to Canny Edge detection algorithm. 
Multiple resolutions may be used or multiple scales or a combination thereof. FIGS. 22B-F 
depict the edge detection at various resolutions and in comparison to an optical image (FIG. 
22A). FIGS. 23B-F depict edge detection at various scales and in comparison to an optical 
image (FIG. 23 A). In a preferred example of such an embodiment, the contamination is detected 
and quantified using the steps of: 

• Generating a CPD sensor peak signal at the boundary between two different areas 
(The peak signals behave much like the "edges" , an image processing term. So, 
the contaminated area can be located by edge detection.); 

• Apply an Edge detection algorithm (such as the 2D Canny algorithm); 

• Multiple resolutions with different thresholds (thereby enabling detection of 
various size of contaminants, i.e. the higher resolution (lower threshold) will find 
the smaller contaminants); 

• Quantifying contamination level (CL) by the edge area over the total wafer area in 
the simplest way. 

[0083] As previously discussed, determining a reference point for the sensor is necessary for 
optimal results. In one embodiment, the reference point is at the center of rotation (in the X-Y 
plane) and at the height of the surface of the wafer (on the Z axis). To find this point, the center 
of rotation and the height of the surface of the wafer must be determined, and then the height 
sensor is correlated with the Z position of the nvCPD sensor. 

[0084] To find the center of rotation, the nvCPD sensor and motion system are used to find a 
geometrical or chemical feature on the surface of the spinning wafer at three or more points. 
Since the wafer is spinning, the feature describes a circle. The center of the circle is the center of 
rotation. Given the coordinates of three distinct points (A(xi,yi), B(x 2 ,y2), and C (x,y)) on the 
diameter of the defined circle on the circle, its center is found algebraically by the equation: 

(x -x0(x -x 2 ) + (y -yi)(y -y 2 ) = 0. 
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Due to slight measurement errors, a different set of points might yield slightly different center 
coordinates. The "true" center of rotation is deemed to be the locus (average) of these points. 
[0085] In one embodiment, to find the height of the surface of the wafer without touching the 
wafer surface, two sensors, the nvCPD sensor and a height sensor (which could itself be an 
nvCPD sensor in an embodiment discussed below) can be used. The nvCPD sensor and height 
sensor are calibrated so that when a reading is taken with the height sensor, the Z-axis coordinate 
of the tip of the nvCPD sensor is ascertained. (This calibration procedure is described below.) At 
that point, the readings of the height sensor are correlated with the Z position of the nvCPD 
sensor. Thereafter, the height sensor is used to detect the position of the surface of the wafer 
without touching it and then the tip of the nvCPD sensor positioned accordingly. 
[0086] In one embodiment, the height sensor is correlated with the Z position of the nvCPD 
sensor based on two assumptions: first, that within its usable range, measurements from the 
height sensor are linear in the Z axis and that a constant, k, can map changes in height 
measurements to proportional changes in Z; and second, that the relative positions of the height 
sensor and nvCPD sensor are fixed, i.e. the two sensors can move relative to the rest of the world 
but only as a unit; they, therefore, cannot move independently. Based on these assumptions, a 
point, P, is picked in the X-Y plane where calibration is to be performed. The height sensor is 
positioned above P, and a measurement from the height sensor, Hm, correlated with a coordinate 
on the Z axis, Zh. Next the nvCPD sensor is positioned above P and move it down until it 
touches at a point, Zc. The nvCPD signal changes significantly when the sensor tip touches the 
surface. Once these values are known, the Z value of the point where the tip of the nvCPD 
sensor would touch the surface is derived with the following equation: 

Zsurface = Z curTen t + Z c - Zh + (H m - H curren t) / k 

wherein: 

Zsurface is the height of the surface where the tip of the nvCPD sensor would touch 

Zcurrent is the current height of the sensor 

Hcurrent is the current height sensor measurement 
[0087] As previously mentioned, the height of the sensor should be measured and controlled to 
produce repeatable results. It is also possible to use an nvCPD sensor to control the height in a 
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semiconductor wafer inspection system in accordance with the principles of the present 
invention. In order to use the nvCPD sensor to control height, the system must provide the 
capability to apply a time-varying bias voltage between the probe tip and the wafer surface. As 
the bias voltage varies, it produces an output signal that is a function of the capacitance between 
the probe tip and the wafer surface. The closer the probe tip is to the surface, the larger the 
output voltage. After the relationship between height and capacitance is determined, the 
magnitude of the output signal can be used to calculate the height of the sensor. The signal 
magnitude can be calculated as the peak-to-peak, standard deviation, RMS, or some other 
measure known in the art. 

[0088] Again, the formula for the output of the nvCPD sensor is: 
dt dt 

The voltage V is the contact potential difference between the probe tip and the wafer surface. If 
a bias voltage is applied, the formula then becomes: 

dt dt 

where Vb is the bias voltage. If the nvCPD sensor is not moving relative to the surface of the 
wafer (or is moving relatively slowly), then the capacitance C and the contact potential 
difference voltage V are not changing, and the equation becomes: 

dt 

[0089] Since the bias voltage is a known fixed frequency and magnitude, the output current is a 
function of the capacitance (C). C is a combination of the capacitance between the probe tip and 
wafer surface, and any stray capacitances in the circuit. The capacitance vs. height function can 
be characterized and used to determine the height of the nvCPD probe at a point above the wafer 
surface. Once the height of the sensor is determined, then the bias voltage can be turned off in 
order to make scanning nvCPD measurements. 

[0090] However, in some embodiments prior to scanning a portion of the wafer, a height 
profile is established with a height sensor and then the scanning height of the nvCPD sensor 
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adjusted appropriately. Figure 18 depicts one embodiment which utilizes a height profile of the 
wafer to position the sensor. The height profile is determined by first moving the height sensor 
to the center of rotation and then, with the wafer spinning, the height sensor is moved out toward 
the edge of the wafer until it senses the edge. Note that this also allows the diameter of the wafer 
to be determined. The sensor is then moved back toward the center until it is within the wafer 
flat(s) or notch. One or more height measurements taken along the way establish the profile. An 
appropriate height for nvCPD sensor scanning is calculated based on the profile, particularly 
based on the maximum detected height. 

[0091] As mentioned above, often nvCPD sensors used in accordance with the principles of the 
present invention generate a peak signal that behaves like noise. In accordance with the 
principles of the present invention, denoising algorithms can be applied to both nvCPD signals 
and nvCPD images. In one embodiment, the nvCPD signal/image data are decomposed into the 
wavelet domain using one of the wavelets available such as but not limited to 'Coiflet 5 , 
c Daubechies', 'Symmlet', and other such wavelets. Then, as a result of the wavelet 
decomposition, a series of wavelet coefficients are obtained at a finite number of scales that can 
be given by the user. A coefficient at a particular scale represents the magnitude of the 
frequency corresponding to that scale at the point corresponding to that coefficient. The nvCPD 
signal/image can then be reconstructed by the coefficients in reverse order. 
[0092] By adjusting the coefficients and performing reconstruction, the three components 
(peak, low frequency, and noise) of the nvCPD signal/image can be selectively filtered out. To 
eject the low frequency component from the nvCPD signal/image, only wavelet coefficients at 
fine scales are used for reconstruction since the low frequency component of the nvCPD 
signal/image are represented by the coefficients at coarse scales. To eject the noise from the 
nvCPD signal/image, the coefficients at fine scales can be shrunk softly based on the threshold 
given. The threshold can be determined using any one of numerous methods known in the art 
such as, but not limited to, 'Visu', 'SURE', 'Hybrid', 'MinMax'. The sharp peak signal that is 
related to contamination on the wafer can be reconstructed substantially in isolation by the 
wavelet coefficients resulting after the two processes above. Thus, noise such as vibrations or a 
wobbling of the wafer can be filtered out of the signal. FIG. 19 depicts an image produced by a 
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system in accordance with the principles of the present invention without deconvoluting or 
denoising the data. FIG. 20 illustrates the improved resolution and definition of an image which 
is denoised in accordance with the principles of the preferred embodiment. 
[0093] A semiconductor wafer inspection system in accordance with the principles of the 
present invention which utilizes a nvCPD sensor may, as discussed above, experience a time 
delay. However, the present invention provides a filtering technique to remove this time delay. 
First, the time delay circuit is modeled as a first order RC circuit. The continuous-time transfer 
function of the RC circuit is given by 

Y(s) = 1 

X(S) TS + l 

where X(s) and Y(s) are the Laplace transformation of the input current signal at the probe tip and 

the output voltage measurement to the data acquisition, and x is the time delay constant. 

[0094] The continuous current signal is fed into and amplified by the amplifier, and then 

converted into a discrete signal through the A/D converter. In this way, the collected data by the 

computer at the final stage is a series of discrete data. For digital signal processing, the 

continuous-time transfer function of the RC circuit is converted into a discrete-time transfer 

function based on Z-transformation. This discretized transfer function has the form 

Y(z) __ a 
X(z)~ z + p 

wherein the constants a and f5 are determined by the discretization method employed, the 
sampling time and the time delay constant, t. 

[0095] Next, in a preferred embodiment, the impulse response of the discretized transfer 
function is determined. In general, the impulse response is a finite number of positive discrete 
values that converges to zero gradually. Once the impulse response is found, the deconvolution 
process with the impulse response is performed on each track data separately. 
[0096] Time constant prediction is important and can be assessed by comparing the positive 
peak height and the negative peak height. Figure 16 shows a typical nvCPD signal where there 
is a pair of a positive peak and a negative peak. It is shown that the positive peak is higher than 
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the negative peak. With zero time delay, the signal would look like Figure 17, where the positive 
peak height is equivalent to the negative peak height. 

[0097] By comparing the positive peak height with negative peak height, the time constant can 
be estimated correctly. If the time constant is underestimated, the former peak (in this example, 
the positive peak) is higher than the latter peak (in this example, the negative peak). If the time 
constant is overestimated, the former peak is lower than the latter peak. By varying the time 
constant, a point when the positive and negative peaks are equivalent in height could be found to 
predict the time constant correctly. 

[0098] The following non-limiting example describes methods of preparation of test wafers 
and sensing characteristic images for identifying certain defect states, chemical states, 
electrostatic states and mechanical features present on a semiconductor wafer surface. 

EXAMPLE 

[0099] Sample wafers can be created by dip coating the wafer 15 in solutions that contain 
known concentrations of contaminants. Part of this example describes metal contaminants such 
as Cu and Fe, although any manner of chemical contaminants can be evaluated in this way. The 
wafer 15 described is either a 100 mm or 150 mm wafer, although these examples apply to any 
size wafer. The wafer surface 1 6 is prepared by dipping in HF to remove oxides. The wafer 1 5 
is then cleaned and partially dipped in the metal contaminant solution. The amount of solution 
remaining on the wafer 15, and the resulting concentration of contaminant on the wafer surface 
16, is controlled by selecting dip coating parameters such as the extraction rate. 
[0100] Partial dipping of the test wafer 15 is preferred to create a transition from clean to 
contaminated areas. Because the nvCPD signal is differential, the nvCPD sensor 12 detects 
changes on the wafer surface 16, as opposed to an absolute value relating to surface condition. 
This aspect of nvCPD sensors 12 is offset by the ability to rapidly image and detect localized 
contamination anywhere on the surface of the wafer 15. 

[0101] After preparation, each test wafer 15 can be, if necessary, analyzed using an appropriate 
combination of XPS, Auger, and RBS (or other well known surface analysis methods) 
techniques to measure actual contaminant concentrations in the dipped areas of the wafer 15. 
Each step involved in the sample wafer preparation process is shown in FIG. 7. In a production 
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line methodology, standards can be established correlating measure actual contamination 
concentration to nvCPD data for routine use. 

[0102] After each sample wafer 15 is created, it can be imaged using a radially scanning 
nvCPD imaging system 10 constructed in accordance with the invention. As described before, 
FIGS. 8 A and 8B show basic forms of the nvCPD imaging system 10, and FIG. 9 shows another 
flow diagram illustration of wafer processing. The system 10 employs the nvCPD sensor 12 
mounted on the previously described three-axis positioning system 26. This positioning system 
26 is used to position the nvCPD sensor 12 above the wafer surface 16 to be imaged, and to scan 
the nvCPD sensor 12 radially across the wafer surface. The wafer 15 is mounted on a spindle 
that rotates at high speed (1800 rpm) beneath the nvCPD sensor 12. The system 10 operates by 
acquiring multiple consecutive tracks of data as the nvCPD sensor 12 is stepped along the radius 
of rotation of the wafer 15. 

[0103] The imaging system 10 has been used for a variety of surface analysis experiments. 
FIGS. 10A, 10B, 1 1 A, and 1 IB show sample wafer images that were generated using the nvCPD 
sensor 12 imaging for wafer inspection. The images show optical images in FIG. 10A and 1 1 A 
and nvCPD images in FIG. 10B and 1 IB of a 100mm form of the wafers 15. The first wafer 15 
was cleaned, and then a small vacuum pick-up device was attached to the surface of the wafer 15 
in three locations. The optical image of FIG. 10A shows no evidence of any change on the 
surface 16 of the wafer 15. The nvCPD image of FIG. 10B shows a very large signal at the 
locations where the pick-up device was applied. The nvCPD signal is believed to be the result of 
a small amount of residue left on the surface 16 by the pick-up device. 

[0104] The second set of images in FIGS 1 1 A and 1 IB show a wafer 15 that has had alcohol 
spun-on and then dried. The resulting residue is not visible in the optical image FIG 1 1 A, but is 
clearly visible in the nvCPD image FIG. 1 IB. These images provide a clear demonstration of the 
usefulness of nvCPD sensor 12 for wafer inspection. Through careful measure of a full rnage of 
defect states and chemical constituents it is possible to correlate an image with a particular 
chemical state, defect, or combination thereof. 

[0105] Figures 12A and 12B show, respectively, an optical image of latex glove marks and a 
nvCPD image of latex glove marks. Figures 13A and 13B show, respectively, an optical image 
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of human fingerprints and an nvCPD image of the fingerprints. FIG. 14 shows a nvCPD image 
of a wafer 15 after brushing the wafer 15 with a stainless steel tool, and FIG. 15 shows a nvCPD 
image of the wafer 15 after pressing an aluminum fixture onto the wafer surface 16. All these 
example images were acquired using the nvCPD sensor 12 with the probe sensor tip 14 having a 
diameter of approximately 60 microns measured over a period of approximately 30 seconds. 
[0106] While preferred embodiments of the invention have been shown and described, it will 
be clear to those skilled in the art that various changes and modifications can be made without 
departing from the invention in its broader aspects as set forth in the claims provided hereinafter. 
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